IBM SAN Volume Controller
   HOME

TheInfoList



OR:

The IBM SAN Volume Controller (SVC) is a block
storage virtualization In computer science, storage virtualization is "the process of presenting a logical view of the physical storage resources to" a host computer system, "treating all storage media (hard disk, optical disk, tape, etc.) in the enterprise as a singl ...
appliance that belongs to the IBM System Storage product family. SVC implements an indirection, or "virtualization", layer in a
Fibre Channel Fibre Channel (FC) is a high-speed data transfer protocol providing in-order, lossless delivery of raw block data. Fibre Channel is primarily used to connect computer data storage to servers in storage area networks (SAN) in commercial data cen ...
storage area network A storage area network (SAN) or storage network is a computer network which provides access to consolidated, block-level data storage. SANs are primarily used to access data storage devices, such as disk arrays and tape libraries from serve ...
(SAN).


Architecture

The IBM 2145 SAN Volume Controller (SVC) is an inline virtualization or "gateway" device. It logically sits between hosts and storage arrays, presenting itself to hosts as the storage provider (target) and presenting itself to storage arrays as one big host. SVC is physically attached to one or several SAN fabrics. The virtualization approach allows for non-disruptive replacements of any part in the storage infrastructure, including the SVC devices themselves. It also aims at simplifying compatibility requirements in strongly heterogeneous server and storage landscapes. All advanced functions are therefore implemented in the virtualization layer, which allows switching storage array vendors without impact. Finally, spreading an SVC installation across two or more sites (''stretched clustering'') enables basic disaster protection paired with continuous availability. SVC nodes are always clustered, with a minimum of 2 and a maximum of 8 nodes, and linear scalability. Nodes are
rack Rack or racks may refer to: Storage and installation * Amp rack, short for amplifier rack, a piece of furniture in which amplifiers are mounted * Bicycle rack, a frame for storing bicycles when not in use * Bustle rack, a type of storage bi ...
-mounted appliances derived from
IBM System x System x is a line of x86 servers produced by IBM – and later by Lenovo – as a sub-brand of IBM's ''System'' brand, alongside IBM Power Systems, IBM System z and IBM System Storage. In addition, IBM System x was the main component of the ...
servers, protected by redundant power supplies and integrated batteries. Earlier models featured external battery-backed power supplies. Each node has Fibre Channel ports simultaneously used for incoming, outgoing, and intracluster data traffic. Hosts may also be attached via FCoE and iSCSI Gbit Ethernet ports. Intracluster communication includes maintaining read/write cache integrity, sharing status information, and forwarding reads and writes to any port. These ports must be zoned together. Write
cache Cache, caching, or caché may refer to: Places United States * Cache, Idaho, an unincorporated community * Cache, Illinois, an unincorporated community * Cache, Oklahoma, a city in Comanche County * Cache, Utah, Cache County, Utah * Cache County ...
is protected by mirroring within a pair of SVC nodes, called ''I/O group''. Virtualized resources (= storage volumes presented to hosts) are distributed across I/O groups to improve performance. Volumes can also be moved nondisruptively between I/O groups, e.g., when new node pairs are added or older technology is removed. Node pairs are always active, meaning both members accept simultaneous writes for each volume. In addition, all other cluster nodes accept and forward read and write requests which are internally handled by the appropriate I/O group. Path or board failures are compensated by non-disruptive failover within each I/O group, or optionally across dispersed I/O groups. Hosts must have multipath drivers installed, such as IBM Subsystem Device Driver (SDD) or standard
MPIO MPIO or Mpio ( Hangul: 엠피오) was a South Korean consumer electronics brand and company. Established in 1998 by Woo Jung-Ku, it was one of Korea's early manufacturers of MP3 players and was known for producing portable digital audio players, ...
drivers. SVC is based on COMmodity PArts Storage System (''Compass'') architecture, developed at the IBM
Almaden Research Center IBM Research is the research and development division for IBM, an American multinational information technology company headquartered in Armonk, New York, with operations in over 170 countries. IBM Research is the largest industrial research org ...
."IBM System Storage SAN Volume Controller", IBM's Redbook SG24-6423-05, pp. 12. The majority of the software has been developed at the IBM Hursley Labs in the UK.


Terminology

* Node - a single 1U or 2U machine. * I/O group - a pair of nodes that duplicate each other's write commands * Cluster - a group of 1 to 4 I/O groups managed as a single entity. ** Stretched cluster - a site protection configuration with 1 to 4 I/O groups, each stretched across two sites, plus a witness site ** Cluster IP address - a single
IP address An Internet Protocol address (IP address) is a numerical label such as that is connected to a computer network that uses the Internet Protocol for communication.. Updated by . An IP address serves two main functions: network interface ident ...
of a cluster that provides administrative interfaces via (
SSH The Secure Shell Protocol (SSH) is a cryptographic network protocol for operating network services securely over an unsecured network. Its most notable applications are remote login and command-line execution. SSH applications are based on ...
and
HTTPS Hypertext Transfer Protocol Secure (HTTPS) is an extension of the Hypertext Transfer Protocol (HTTP). It is used for secure communication over a computer network, and is widely used on the Internet. In HTTPS, the communication protocol is enc ...
) ** Service IP address - an IP address used to service an individual node. Each node can have a service IP configured. ** Configuration node - a single node that holds the cluster's configuration and has the assigned cluster IP address. * Master Console (or SSPC) - a management GUI for SVC until rel 5.1, based on
WebSphere Application Server WebSphere Application Server (WAS) is a software product that performs the role of a web application server. More specifically, it is a software framework and middleware that hosts Java-based web applications. It is the flagship product with ...
; not installed on any SVC node, but on a separate machine ** As of SVC rel 6.1, a Master Console (SSPC) is no longer used. Web based administration is done directly on the configuration node, using a
HTML5 HTML5 is a markup language used for structuring and presenting content on the World Wide Web. It is the fifth and final major HTML version that is a World Wide Web Consortium (W3C) recommendation. The current specification is known as the HTML ...
GUI. * Virtual Disk (
VDisk A logical disk, logical volume or virtual disk (VD or vdisk for short) is a virtual device that provides an area of usable storage capacity on one or more physical disk drive(s) in a computer system. The disk is described as ''logical'' or ''virtu ...
) - a unit of storage presented to the host. The release 6 GUI refers to a VDisk as a Volume. * Managed Disk (MDisk) - a unit of storage (a LUN) from a real, external
disk array A disk array is a disk storage system which contains multiple disk drives. It is differentiated from a disk enclosure, in that an array has cache memory and advanced functionality, like RAID, deduplication, encryption and virtualization. Compo ...
, virtualized by the SVC. An MDisk is the base to create an image mode VDisk. * Managed Disk Group - (MDisk Group) a group of one or more Mdisks. The extents of the MDisks in an MDisk Group are the base to create a striped or sequential mode VDisk. The release 6 GUI refers to a Managed Disk Group as a Pool. * Extent - a discrete unit of storage; an MDisk is divided into extents; a VDisk is formed from set of extents.


Timeline

The different SAN Volume Controller models were available for purchase shortly after the mentioned announcement day. The light green bars show the period of time when each model could be ordered, while the light blue bars show how long the standard service was continued after withdrawal from marketing. The displayed information is current in August 2019. There are differences in service conditions between 2145 and 2147, but not in hardware.


Performance

Release 4.3 of the SVC held the
Storage Performance Council Storage may refer to: Goods Containers * Dry cask storage, for storing high-level radioactive waste * Food storage * Intermodal container, cargo shipping * Storage tank Facilities * Garage (residential), a storage space normally used to store ...
(SPC) world record for SPC-1 performance benchmarks, returning nearly 275K (274,997.58)
IOPS Input/output operations per second (IOPS, pronounced ''eye-ops'') is an input/output performance measurement used to characterize computer storage devices like hard disk drives (HDD), solid state drives (SSD), and storage area networks (SAN). Lik ...
. There was no faster storage subsystem benchmarked by the SPC at that time (October 2008). The SPC-2 benchmark also returned a world leading measurement of over 7 GB/s throughput. Release 5.1 achieved new records with a 4 node and 6 node cluster benchmark with DS8700 as backed storage device. SVC broke its own record of 274,997.58 SPC-1 IOPS in March 2010, with 315,043.59 for the 4 node cluster and 380,489.30 with the 6 node cluster, records that stood until October 2011. Release 6.2 of the SVC held the
Storage Performance Council Storage may refer to: Goods Containers * Dry cask storage, for storing high-level radioactive waste * Food storage * Intermodal container, cargo shipping * Storage tank Facilities * Garage (residential), a storage space normally used to store ...
(SPC) world record for SPC-1 performance benchmarks, returning over 500K (520,043.99) IOPS (I/Os per second) using 8 SVC nodes and Storwize V7000 as the backend disk. There was no faster storage subsystem benchmarked by the SPC at that time (January 2012). The full results and executive summaries can be reviewed at the SPC website referenced above."Cache hit" or "bandwidth" performance numbers are usually much higher, e.g. "20 GBPS", but are relatively meaningless as they cannot be achieved in real-word scenarios. Release 7.x provides multiple enhancements including support for additional CPUs, cache and adapters. The streamlined cache operates at 100µs fall-through latency and 60 µs cache-hit latency, enabling SVC as a front-end to
IBM FlashSystem IBM FlashSystem is an IBM Storage enterprise system that stores data on flash memory. Unlike storage systems that use standard solid-state drives, IBM FlashSystem products incorporate custom hardware based on technology from the 2012 IBM acquis ...
solid-state storage without significant performance penalty. (See also: FlashSystem V9000).


Included Features (7.x)

; Indirection or mapping from virtual LUN to physical LUN : Servers access SVC as if it were a storage controller. The SCSI LUNs they see represent virtual disks (volumes) allocated in SVC from a pool of storage made up from one or more managed disks (MDisks). A managed disk is simply a storage LUN provided by one of the storage controllers that SVC is virtualizing. The virtual capacity can be larger than the managed physical capacity, with a current maximum of 32 PB, depending on management granularity (''extent size'') ; Data migration and pooling : SVC can move volumes from one capacity pool (''MDisk group'') to another whilst maintaining I/O access to the data. Write and read caching remain active. Pools can be shrunk or expanded by removing or adding hardware capacity, while maintaining I/O access to the data. Both features can be used for seamless hardware migration. Migration from an old SVC model to the most recent model is also seamless and implies no copying of data. ; Importing and exporting existing LUNs via Image Mode : "Image mode" is a non-virtualized pass-through representation of an MDisk (managed LUN) that contains existing client data; such an MDisk can be seamlessly imported into or removed from an SVC cluster. ; Fast-write cache : Writes from hosts are acknowledged once they have been committed into the SVC mirrored cache, but prior to being destaged to the underlying storage controllers. Data is protected by replication to the peer node in an I/O group (cluster node pair). Cache size is dependent on the SVC hardware model and installed options. Fast-write cache is especially useful to increase performance in midrange storage configurations. ; Auto tiering (Easy Tier) : SVC automatically selects the best storage hardware for each chunk of data, according to its access patterns. Cache unfriendly "hot" data is dynamically moved to solid state drives ''SSD'', whereas cache friendly data as well as "cold" data is moved to economic spinning disks. Easy Tier also monitors and optimizes spindle-only workloads if no solid state storage is attached. Idem, Easy Tier automatically optimizes solid-state workloads between Enterprise- and Read Intensive Flash media. ; Solid state drive (SSD) capability : SVC can use any supported external SSD storage device or provide its own internal SSD slots, up to 32 per cluster. These can be used to boost aging spinning disk pools: Easy Tiering is automatically active in hybrid mixed-media capacity pools. ; Thin Provisioning : LUN capacity is only used when new data is written to a LUN. Data blocks equal zero are not physically allocated, unless previous data unequal zero exists. During import or during internal migrations, data blocks equal zero are discarded (''Thick-to-thin migration''). : Besides, thin provisioning is integrated in the FlashCopy features detailed below to provide space-efficient snapshots ; Virtual Disk Mirroring : Provides the ability to maintain two redundant copies of a LUN, implicitly on different storage controllers ; Site protection with ''Stretched Cluster'' : A geographically distributed, highly available clustered storage setup leveraging the virtual disk mirroring feature across datacenters within 300 km distance. Stretched Clusters can span 2, 3 or 4 datacenters (chain or ring topology, a 4-site cluster requiring 8 cluster nodes). Cluster consistency is ensured by a majority voting set. : From two storage devices in two datacenters, SVC presents one common logical instance. User-side operations like Snapshot or LUN Resizing apply at the logical instance level. Hardware-oriented operations like real-time compression or live hardware migration occur at the physical instance level. : Unlike in classical mirroring, logical LUNs are readable and writable on both sides (''tandem'') at the same time, removing the need for ''failover'', ''role switch'', or ''site switch'' as found in Site Recovery managing products. The feature can be combined with
Live Partition Mobility Live Partition Mobility is a chargeable Live migration feature of IBM POWER6, POWER7, POWER8 and POWER9 servers, available since 2007, that allows a running LPAR to be relocated from one system to another. In concept, it is similar to VMware VMot ...
or
VMotion VMware, Inc. is an American cloud computing and virtualization technology company with headquarters in Palo Alto, California. VMware was the first commercially successful company to virtualize the x86 architecture. VMware's desktop software ru ...
to avoid bulk data transport during a metro-distance virtual server motion. ; Geographical crossover access : All SVC cluster nodes in a Stretched Cluster have read/write access to storage hardware in the mirror location, removing the need for site-resynchronization in case of single node failures. This feature is mutually exclusive with ''Enhanced Stretched Cluster'', and only recommended for single stretched node pairs. ; Hot Standby Nodes : Powered nodes that can take over the role of failed nodes in a stretched or local cluster at very short notice. ; Enhanced Stretched Cluster : A functionality optimizing data paths within a metro- or geo-distance Stretched Cluster (see above), helpful when bandwidth between sites is scarce and cross-site traffic must be minimized. SVC will attempt to use the shortest path for reads and writes. For instance, cache write destaging to storage devices is always performed by the most nearby cache copy, unless its peer cache copy is down. Two node pairs are the recommended minimum for an Enhanced Stretched Cluster. ; Stretched Cluster with Golden Copy (3-site DR) : A Stretched Cluster that maintains an additional synchronous or asynchronous data copy on an independent Stretched Cluster or SVC or Storwize device at geo distances. The ''Golden Copy'' is a disaster protection against metro-scale outages impacting the Stretched Cluster as a whole. It relies on licensed Metro- or Global Mirror functionality. ; Hyperswap : The ability to seamlessly fail over data access between geographically dispersed IO groups or clusters. As with ''Stretched Cluster'', both sides accept simultaneous writes, but write cache data is mirrored locally in both sites as IO groups are kept together. Hyperswap can be combined with
Live Partition Mobility Live Partition Mobility is a chargeable Live migration feature of IBM POWER6, POWER7, POWER8 and POWER9 servers, available since 2007, that allows a running LPAR to be relocated from one system to another. In concept, it is similar to VMware VMot ...
or
VMotion VMware, Inc. is an American cloud computing and virtualization technology company with headquarters in Palo Alto, California. VMware was the first commercially successful company to virtualize the x86 architecture. VMware's desktop software ru ...
for maximized application availability. On the server side, Hyperswap works with most native multipath drivers with ''ALUA'' support. Hyperswap relies on Metro Mirror functionality and requires a Metro Mirror license, as well as a minimum of two node pairs. ; Transparent Cloud Tiering : Swift- and S3-compatible object datastores can be used as a cold tier for incremental volume snapshots and volume archives without live production access. This allows keeping hourly ''time machine'' copies or archiving VM images including attached volumes at a price point somewhat closer to tape media. On-premise datastore support is provided via OpenStack Swift. Off-premise datastore support is provided by Amazon S3 or Softlayer. Off-premise Transparent Cloud Tiering per default uses AES encryption, which is a licensed feature.


Optional features

There are some optional features, separately licensed e.g. per TB: ; Real-Time Compression : This in-flight data reduction technology offers a footprint reduction of 50% (guaranteed) or up to 80% (found in Oracle databases). Leveraging dedicated compression hardware, it has generally no performance impact and is usable for heavy duty databases. The
temporal locality In computer science, locality of reference, also known as the principle of locality, is the tendency of a processor to access the same set of memory locations repetitively over a short period of time. There are two basic types of reference localit ...
of the algorithm can even ''increase'' the read performance on adequate data patterns such as SQL databases stored on spinning disks. The compression efficiency is equal to "zip" (
Lempel–Ziv–Welch Lempel–Ziv–Welch (LZW) is a universal lossless data compression algorithm created by Abraham Lempel, Jacob Ziv, and Terry Welch. It was published by Welch in 1984 as an improved implementation of the LZ78 algorithm published by Lempe ...
) with a very large dictionary, and can accurately be predicted across Petabytes using the ''Comprestimator'' tool. :Real-Time Compression can be combined with Easy Tiering, Thin Provisioning and Virtual Disk Mirroring. It was initially invented by the acquired startup Storwize Inc., which also served as new name for the SVC-derived IBM storage systems family. ;
FlashCopy FlashCopy is an IBM feature supported on various IBM storage devices that made it possible to create, nearly instantaneously, point-in-time snapshot copies of entire logical volumes or data sets. The Hitachi Data Systems Hitachi Data Systems ...
(Snapshot) : This is used to create a
disk snapshot In computer systems, a snapshot is the state of a system at a particular point in time. The term was coined as an analogy to that in photography. It can refer to an actual copy of the state of a system or to a capability provided by certain syste ...
for backup/rollback or application testing of a single volume. Snapshots require only the "delta" capacity unless created with full-provisioned target volumes. FlashCopy comes in three flavours: Snapshot, Backup volume, and Clone, which is automatically unlinked from its source. All are based on optimized
copy-on-write Copy-on-write (COW), sometimes referred to as implicit sharing or shadowing, is a resource-management technique used in computer programming to efficiently implement a "duplicate" or "copy" operation on modifiable resources. If a resource is dupl ...
technology. :One source volume can have up to 256 simultaneous targets. Targets can be made incremental, and cascaded tree like dependency structures can be constructed. Targets can be re-applied to their source or any other appropriate volume, also of different size (e.g. resetting any changes from a resize command). :Copy-on-write is based on a
bitmap In computing, a bitmap is a mapping from some domain (for example, a range of integers) to bits. It is also called a bit array A bit array (also known as bitmask, bit map, bit set, bit string, or bit vector) is an array data structure that c ...
with a configurable ''grain'' size, as opposed to a
journal A journal, from the Old French ''journal'' (meaning "daily"), may refer to: *Bullet journal, a method of personal organization *Diary, a record of what happened over the course of a day or other period *Daybook, also known as a general journal, a ...
. ; FlashCopy rollback ''(time machine)'' : Provides a time-machine inspired rollback capability using selectively granular consistency points. The consistency mechanism can cover many LUNs at a time. Rollback requires a FlashCopy license and the Spectrum Control Snapshot software. ;
Metro Mirror Peer to Peer Remote Copy or PPRC is a protocol to replicate a storage volume to another control unit in a remote site. Synchronous PPRC causes each write to the primary volume to be performed to the secondary as well, and the I/O is only considere ...
- synchronous remote replication : This allows a remote disaster recovery site at a distance of up to about 300km ;
Global Mirror Global Mirror is an IBM technology that provides data replication over extended distances between two sites for business continuity and disaster recovery. If adequate bandwidth exists, Global Mirror provides a recovery point objective (RPO) of as ...
- asynchronous remote replication : This allows a remote disaster recovery site at a distance of thousands of kilometres. Each Global Mirror relationship can be configured for high latency / low bandwidth or for high latency / high bandwidth connectivity, the latter allowing a consistent recovery point objective ''RPO'' below 1 sec. ;
Global Mirror Global Mirror is an IBM technology that provides data replication over extended distances between two sites for business continuity and disaster recovery. If adequate bandwidth exists, Global Mirror provides a recovery point objective (RPO) of as ...
over IP - remote replication over the Internet : uses ''SANslide'' technology integrated into the SVC firmware to send mirroring data traffic across a TCP/IP link, while maximizing the bandwidth efficiency of that link. This may result in a 100x data transfer acceleration over long distances. ;
Encryption In cryptography, encryption is the process of encoding information. This process converts the original representation of the information, known as plaintext, into an alternative form known as ciphertext. Ideally, only authorized parties can decip ...
of data at rest : SVC and other ''Spectrum Virtualize''-based devices can transparently encrypt data on any local media, virtualized attached storage, or cloud tier (per default). The encryption mechanism is 256-bit AES-XTS. Keys are either generated locally and stored on removable thumb drives or obtained from a key lifecycle management service. Both options are mutually exclusive.


Other products running SVC code

On 7 October 2010, IBM announced the
IBM Storwize V7000 IBM Storwize systems were virtualizing RAID computer data storage systems with raw storage capacities up to 32 PB. Storwize is based on the same software as IBM SAN Volume Controller (SVC). Formerly Storwize was an independent data storage organ ...
, the first member of the Storwize family. Storwize uses the SAN Volume Controller code base with internal storage to provide a mid-price storage subsystem. The IBM Storwize V5000, V3700 and V3500 are shrunk compatible models with less cache/CPU/adapters and a reduced set of features. The IBM FlashSystem V9000 leverages the SVC firmware integrated with
IBM FlashSystem IBM FlashSystem is an IBM Storage enterprise system that stores data on flash memory. Unlike storage systems that use standard solid-state drives, IBM FlashSystem products incorporate custom hardware based on technology from the 2012 IBM acquis ...
solid-state drawers. In 2015, IBM re-badged the virtualization functionality as ''Spectrum Virtualize'', in order to align it with the IBM software-defined storage naming conventions and to highlight the interoperability aspect.


Non-IBM products running SVC code

The
Actifio Actifio was a privately held information technology firm headquartered in Waltham, Massachusetts. The company specialised in copy data virtualization for making information technology infrastructure more efficient by reducing unnecessary duplicati ...
Protection and Availability Storage (PAS) appliance includes elements of SVC code to achieve wide interoperability. The PAS platform spans backup, disaster recovery, and business continuity among other functions.


See also

* IBM Subsystem Device Driver (SDD), a multipathing driver for IBM System Storage, originally used by
IBM Enterprise Storage Server The IBM Enterprise Storage Server (ESS) or the ''Shark'' is an enterprise storage array from IBM. History Originally, in 1998 IBM released the IBM 2105 Versatile Storage Server (VSS). It did not meet commercial success and the successor came in ...
* EMC
VPLEX EMC VPLEX is a virtual computer data storage product introduced by EMC Corporation in May 2010. VPLEX implements a distributed "virtualization" layer within and across geographically disparate Fibre Channel storage area networks and data center ...
, a competing solution *
IBM Storwize family IBM Storwize systems were virtualizing RAID computer data storage systems with raw storage capacities up to 32 PB. Storwize is based on the same software as IBM SAN Volume Controller (SVC). Formerly Storwize was an independent data storage organ ...


Footnotes


References


External links


Links to SVC Redbooks, info, updates

Official IBM Storage Virtualization Page

Official IBM SAN Page

Official IBM Redbooks page for SVC

Official IBM Redbooks page for Storwize V7000
{{DEFAULTSORT:Ibm San Volume Controller SAN Volume Controller Storage software Storage virtualization